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*5i Abstract 

The dynamics of technological, economic and social phenomena is controlled by how humans 
O 



organize their daily tasks in response to both endogenous and exogenous stimulations. Queueing 
theory is believed to provide a generic answer to account for the often observed power-law distri- 
butions of waiting times before a task is fulfilled. However, the general validity of the power law 
and the nature of other regimes remain unsettled. Using anonymized data collected by Google 
at the World Wide Web level, we identify the existence of several additional regimes character- 
izing the time required for a population of Internet users to execute a given task after receiving 



a message. Depending on the under- or over-utilization of time by the population of users and 



C — ' the strength of their response to perturbations, the pure power law is found to be coextensive 

O . 

with an exponential regime (tasks are performed without too much delay) and with a crossover to 

an asymptotic plateau (some tasks are never performed). The characterization of the availability 

•i-H . 

and efficiency of humans on their actions revealed by our study have important consequences to 
understand human decision-making, optimal designs of policies such as for Internet security, with 
spillovers to collective behaviors, crowds dynamics, and social epidemics. 
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I. INTRODUCTION 



Consider a typical individual, who is subjected to a flow of solicitations. In the presence of 
time, energy, regulatory, social and monetary influences and constraints, such an individual 
must set priorities that will probably trigger a delay in the execution of task related to 
one message. Recent studies of various social systems have established the remarkable fact 
that the distribution Q(t) of waiting times between the presentation of the message and 
the ensuing action has a power law asymptotic of the form Q(t) ~ l/t a , with an exponent 
a often found smaller than 2. Examples include the distribution of waiting time until a 
message in answered in email [l|] and mail |2[ correspondence, and in other human activity 
patterns, like web browsing, library visits, or stock trading [3j. 

A related measure concerns the response of an individual, social or economic system, 
measured by a rate R(t) of activity, following a perturbation, an announcement or a shock. 
When the message is delivered directly and simultaneously to a large number of actors, the 
measure of activity in this population can be shown to be proportional to the distribution 
Q(t) of waiting times (in the absence of word-of-mouth effects) [4|, or nonlinearly related to 
Q(t) (in the presence of social influences) [7(]. Such measures of activity following a shock 
have also been documented to decay extremely slowly, approximately like a power law of 
the time counted from the shock 26j. For instance, measures of media coverage after a large 
geo-political event (such as the assassination of the Prime Minister of India on October 31, 
1984, the coup against Gorbachev in August 1990 or the terrorist attacks on September 11, 
2001) decay approximately as a power law of time since the occurrence of the event [9j. The 
rate of downloads of papers or book sales from a website after a media coverage also follows 



a power law decay 10 
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"he relaxation of financial volatility after a peak follows a power 
law, with different regimes 14J. An alternative measure of the number of volatility peaks 
above some threshold also have been found to exhibit this power law decay 15|, called the 
Omori law in analogy with a similar behavior of the rate of aftershocks following a large 
earthquake The study of the rate of video views on YouTube has confirmed the existence of 
power law decay after peaks associated with media exposure |8j, on a very large database of 
millions of videos. The dynamics of visitations of a major news portal (5|, and the decay of 



popularity of Internet blog posts 



6| also follow the same power law decay. 



The following section 2 presents an economic model of priority queueing processes, whose 



main point is to rationalize the empirical observation that systems are most often than not 
found to operated close to the critical point of the priority queueing model. The attraction 
to the critical point is explained in terms of the competition between improving the utility of 
time allocation and the cost of time management. A slight extension of the priority queueing 
model allows to account for behavioral biases and heterogeneity of agents. Section 3 then 
presents the data set and the results of the calibration of the empirical data on browser 
updates by the model. Section 4 concludes. The appendix provides relevant information on 
the data collection process. 

II. ECONOMIC MODEL OF PRIORITY QUEUEING PROCESSES 
A. Introduction to priority queueing models 



Such power laws have been rationalized by "priority queueing" models 16l-ll9|. in which 
tasks enter the queue following some stochastic process and are addressed in order of their 
priority. The key control parameter of these priority models is the "time-deficit" parameter 
/3, defined as the difference between the average time (r) needed to complete a task and the 
average time interval (77) between task arrivals. 

For (3 < 0, all tasks are eventually completed while for (3 > 0, there is a strictly positive 
probability for the low priority tasks never to be done. The value = is a critical 
point of the theory for which the cumulative distribution of waiting times exhibits an exact 
asymptotic power law tail Qp=o(t) ~ l/t a with exponent a = 0.5. In the extremal version 



where a task is singled out with the lowest priority 20], deviations of f3 from can be 
classified into two opposite regimes: for (3 < 0, Q(t) develops an exponential tail; for [3 > 0, 
Q(t) crosses over to a plateau at < Qoo < 1, equal to the probability that the low-priority 
task is never completed. 

Priority queueing models assume an exogenously determined mapping between the in- 
flow of tasks and their priorities, which determine the ordering of their fulfillment. In 
reality, there are certain tasks that may modify the whole task-solving process. We model 
this phenomenon by considering that the time-deficit parameter (3(t) is actually a dynamic 
variable, determined from the interplay between the incentive for agents to maximize the 
utility of their time and the costs of adaptation to the stochastic flow of tasks. 



B. Mapping of priority queueing onto consumption economic theory 

In general, individuals tend to maximize their utility given their available resources (which 
include the issue of time allocation). However, the essential difference between the standard 
consumption optimization and tasks allocation is that time is a limited and non storable 
resource. We proceed by analogy with the standard utility theory of decision making, in 
which a representative agent has to optimize her consumption flow over her lifetime dis- 
counted by her pure rate of time preference, given the intertemporal budget constraint that 
the present value of future consumption equals the present value of future income [27] . Let 
us now consider the following analogy: 

available wealth W —J- available time T (1) 

consumption spends C units of wealth — > solving a task consumes time r 

utility from consuming C : U{C) — > gain from solving a task taking time r : U(t) (2) 

total utility of consuming over T periods : — > total utility from solving N tasks during time T (3) 

T N 

U = Y,u(ct)e- pt dt -> U = Y,u(7n) (4) 

T T T 

budget constraint V", u(ct)e~ rt dt = \~] VtG~ rt dt —$■ time budget \~] Ti — ^ ■ (^) 

Here, p is the pure rate of time preference and r is the market interest rate. In the standard 
program on the left, the rational agent aims at maximizing U = Ylt=i u(ct)e~ pt dt subject to 
the budget constraint written on the left side of (jSJ). In the program on the right, a rational 
agent attempts to maximize her utility U = Y2i=i u ( 1T i) derived from performing the tasks 
or occupations during a fixed time interval T, subject to the condition that the total time 
spent on these tasks (X)t=i r «) cannot be larger than T. 

C. Optimal time consumption leads to convergence to the critical point f3 — > 

The essential difference between the optimization program on the right compared to the 
standard consumption optimization on the left is that time is non-storable. Unused time 
is lost forever. Thus, the rational agent will try to find new occupations so as to increase 
the number of tasks in each time interval available to her. If she finds that her N tasks 
performed during time T consume ^2 i=1 Tj which is less than T, she will search for other 



tasks to increase her utility. This can take for instance the form of education (the analog 
of investing for future income yt) which provides novel opportunities of tasks in the future 
that will increase her future utility. She may also spend time exploring new opportunities, 
which is the analog of diversification of wealth investments for future consumption. In this 
way, over the long run, these strategies will make the agent tend to increase the utilization 
k of her time defined by 

N(T) 

1=1 

The time budget constraint (J5J) then reads k < 1. 

With the definition of the time deficit parameter (3 := (r) — (77), in terms of the difference 
between the average of the duration (r) of a task and the average waiting time (r/) between 
task arrivals, we have 

N(T) 



K) 



^E^> = ^> = fev = 1 + 7§' for^<l(/3<0), (7) 



T ^ " T^ ' (77) (r/) 



i=\ 



where we have used that (77) = T/N in the regime k < 1. A negative time deficit parameter 
/3 < corresponds to a sub-optimal utilization (k < 1). 

As the agent increases N for a fixed T, the utilization k will eventually reach the upper 
bound 1 associated with the time budget constraint (ED). As a consequence, the agent will 
not be able to increase her total utility anymore even by increasing N further, since only a 
subset of the N available tasks will be performed in the allotted time T. Worse, in this case, 
one needs to include a new term in the total utility of the agent, in order to account for 
the cost of task management that appears when the total burden of tasks increases above 
the time available to solve them. Indeed, in this case, the agent who faces more tasks or 
occupations that she can perform has to choose among all available tasks a subset that she 
will be able to address in her available time T. This takes the form of a cost, either just 
in the form of time used to manage and prioritize her tasks and of other resources spent to 
perform this management. Then, since the average utility E[C7] = E[^ i=1 u(tj)] saturates 
to a constant when ^2t=i Ti reac hes T, the total utility which includes the management cost 
when Ylt=i T i > T becomes a decreasing function of the number of tasks. 

Therefore, in order to reduce waste of time (resp. overstress due to accumulation of 
pending tasks), a rational agent in a stationary set-up will tend to adjust her time-deficit /3 
as close to zero as is possible, therefore maximizing her utility. As for geographies of cities 



or economies of firms [28|, |29[, the critical power law regime with a = 0.5 resulting from 
— 0, can be interpreted as the result of optimal allocation of resources that ensures the 
maximum sustainable use of time. 

We summarize the optimization process leading to the convergence to the critical point 
(3 = as follows. 

• for k < 1, the total utility is less than optimal and the agent will increase N/T by 
finding new tasks or occupations, in order to increase her utility; 

• when k reaches 1 and X]*=i r « > 1 ""> ner utility becomes a decreasing function of N/T 
due to management costs. 

• Hence, the optimal number of tasks is such that T/N := (rj) = (r), for which (3 = 
and the utilization k reaches marginally its upper bound 1. This boundary value for 
K corresponds to the critical value (3 = of the priority queueing theory, which is 
derived here as the fixed attractive point of a utility maximizing agent under general 
conditions. 

D. Extension to account for behavioral biases and heterogeneity 

However, humans are no perfect time utility maximizers and suffer from cognitive biases 
as well as emotional quirks. This can result in departures from the optimum (3 = 0, due 
to imperfect anticipations of new tasks and changes in the solving rate. Fig. 1 presents 
several possible scenarios on how the arrival of a given task may trigger a perturbation, thus 
changing (3{t) and the response Q(t). We capture these effects by modeling the dynamics 
of j3 by a mean-reverting process around some attractive point (3 close to the critical value 
(3 = 0. 



The simplest mean-reverting dynamics is the Ornstein-Uhlenbeck process [30|, which 
predicts that the distribution of /3's estimated over a large population of similar agents is a 
Gaussian law centered on (3 with a standard deviation (std) (3$ determined by the efficiency 
with which agents optimize their use of time. The parameter (3 quantifies the intrinsic 
propensity of the population of agents to under-use ((3 < 0) or over-stretch ((3 > 0) their 
time. The std (3q is proportional to the speed with which the agents adapt to changing 
conditions. Large /3 's correspond to a diffuse and weak response to perturbations and 
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changing conditions of incoming tasks. Under these conditions, the population of agents can 
be parameterized by the Gaussian distribution ip((3; (3, f3 ) 

W;ft A>) = ^=jFexp (- i(3 ~] )2 ) • (8) 



V^Fo V 2/3, 



In the presence of such a distribution of time deficit parameters (3, assuming indepen- 
dence between individuals with different /3's, the survival distribution Q(t) of the waiting 
times until completion of the target task is simply the average of the one-person survival 
distribution Q(t; f3) of waiting times between message and action for a specific /3, weighted 
by the population density i/j(/3; (3,(3o): 

r+oo 

Q(t; /3, f3 ) = / Q(t; /3)^(/3; /3, j3 )d(3 . (9) 



Using the form of Q(t; (3) previously obtained [20j] , expression ([9]) predicts different regimes 
that we now test on our dataset. 

III. EMPIRICAL RESULTS 

A. Data sets 

We consider a system where word-of-mouth effects and other social interactions are essen- 
tially absent, so as to test the generalized queueing theory as cleanly as possible. Specifically, 
we study the persistence of the use of outdated Web browsers (Firefox, Opera, Chrome and 
Safari) after users have been prompted to perform an update. Our data is obtained from 
anonymized daily log files of Google web servers (more than 70% of the worldwide daily 



searches), collected over more than three years |23|] (cf. the Appendix). 

The release of a new browser version is typically accompanied by an alert message no- 
tifying the user of the pending update. The message is delivered to all users at the same 
time and the update is performed at different times, if ever, by an heterogenous population 
of people. 

B. Analysis of data with the priority queuieing model 

Fig. 2 shows the time dependence of the fraction of the population who use a given 
Mozilla Firefox browser version. Each time a new version is released, the fraction of users 



using the outdated version drops initially very fast and then very slowly, in favor of the 
new version which has its market share increase inversely. Each version goes through a life 
cycle of fast increase of usage followed by a slow concave asymptotic regime, and then a 
sharp collapse at the time of the introduction of the next version, continuing into a slow 
asymptotic decay. Almost all versions of all browsers exhibit this characteristic behavior. 

Fig. 3a shows a few representative decays of browser use after the introduction of a 
new version, taken as the origin of time, and their fit by expression (Q. For each fit, the 



corresponding distribution ip((3; (3, fio) is shown in panel 3b (see 32[ for the fits of all 44 
browser versions that we have analyzed). 

• Mozilla Firefox 3.0.3 (orange circles and line) is well fitted by a simple power law with 
exponent a ~ 0.5, and the corresponding probability distribution function ip((3; @, (3q) 
is close to a Dirac function centered on 0. 

• For Mozilla Firefox 1.5.0.8 (green crosses and line), the decay is essentially exponential. 
Its corresponding ip((3;(3,f3o) lies in a narrow range of negative /3's (j3 < and (3 < 
|/3 1/3). This version is associated with a population under utilizing their time and 
with rapid responses to perturbations. 

• For Mozilla Firefox 1.5.0.11 (red diamonds and line), the power law decay with ex- 
ponent a ~ 0.9 ±0.1 is followed by an upward curvature, which can be interpreted 
as the convergence to a non-zero plateau at much larger times. The corresponding 
pdf ip((3] /3,/3o) is very broad with f3 < and |/3| ~ /3 , while the upward curvature is 
contributed by the fraction of positive /3's in the population. For this browser version, 
the generally quite fast upgrade is accompanied by a large heterogeneity of behaviors, 
which can be due to complications preventing a straightforward execution of the task. 
This version is again associated with a population under utilizing their time but with 
slow and weak adaptation to perturbations. 

• For Apple Safari 2 (SF2), (blue +'s and line), Q(t; /3, f3o) decays very slowly as a power 
law with exponent a ~ 0.3 ± 0.1, which qualifies the procrastination mode discussed 
with Fig. lc. The decay is followed by an upward curvature, again suggesting the 
convergence to a plateau quantifying the fraction of users who will never update their 
browser. 
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• For SF2, we show two decays following two successive announcements for this version 
update (October 26th and November 16th, 2007). These two versions are associated 
with a population over utilizing their time and with relatively rapid responses to 
perturbations. 

Figure 4 depicts the phase diagram of the different decay regimes in the plane (/3,/3 ). 
The crosses represent the best parameters fitted to the 44 browser versions updates that we 
have analyzed. The line (3 = —\f3 separates a lower region, in which more than 99.5% of /3's 
are negative and basically all users end up executing the target tasks, from an upper domain 
in which a significant fraction of the population has /3 > for which the decay saturates at 
a plateau giving the probability for the target task never to be completed. 

IV. CONCLUSION 

We have shown that the generalization of priority queueing models to encompass a het- 
erogenous and dynamically varying population of users provide a complete rationalization 
of the large variability of responses of individuals to messages. These results show that it 
is possible to measure and predict the efficiency of time allocation by humans, individually 
or collectively, in a particular or more general context. Our result open new perspective 
for understanding and modeling decision-making by humans in real life, but also effects in 



crowds 



25| , in the presence of social interactions and viral epidemics 
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Appendix: Data Collection 

We use data obtained by Google Switzerland from anonymized daily log files of Google's 



global web services including Google search 
campaigns and over more than three years 



www . google . com"]) performed in two distinct 



23 



24|. We only considered Apple Safari, 



Mozilla Firefox, Opera and Google Chrome. We had to exclude Microsoft Internet Ex- 
plorer because we could not retrieve all minor updates and it is used by large organizations 
(e.g. corporations) that apply specific update policies, which could significantly bias the re- 
sults. 

For each type of browser, we extracted the daily fraction of users for each version of 
this browser. Since Google searches account for more than 70% of the hundreds of millions 
of searches performed each day on the World Wide Web, our database provides a unique 
window to the dynamics of the use of browsers at the largest scale. Data have been collected 
by Google Inc. in two campaigns over two years: 

• Campaign 1 : from 20th December 2006 until 31st July 2008. 

• Campaign 2 : from 21th October 2008 until 17th April 2009. 

All browsers daily activity are recorded either twice a week before 10th October 2007 and 
after 20th December 2008, and every day between these two dates. 

When a browser communicates with a Web server, it usually sends the user-agent string 
in the HTTP protocol header with every request for a Web page. This string contains the 
type and version of the browser and the type of operating system the browser runs on. To 
ensure that an Internet user visiting several times Google within one day is not counted 
twice or more, each visit was linked with Google's PREF cookie to eliminate duplicate visits 
from the same browser. We ignored the small fraction of browsers that disabled cookies due 
to restrictive user settings. We also ignored the small possibility of cookie id collisions and 
the effect of users deleting cookies manually. There are also some proxies that change the 
user-agent string. Based on the observed dynamics of this research, we expect this effect to 
be small. Some hosts send fake "HTTP user agent" string for various reasons (web crawling, 
bots, etc.), but we can assume that this phenomenon is also marginal compared to the 75% 



of the World Wide Web traffic captured by Google at this time 
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23] 



Google has a strict confidentiality policy to protect users and all data are anonymized 
after six months. Additionally, we only had access to relative activity of browsers over time 
per browser. This means that the sum over all versions of each browser (resp. Chrome, 
Firefox, Opera or Safari) is equal to 1. Out of 309 browser version traces identified in 
the database only 44 had fully exploitable decays, namely exhibiting a fast drop due to an 
exogenous shock (the release of a newer version) followed by a decay over a sufficient period 
of time (between 60 and 600 days). 
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FIG. 1: Simulations of three typical strategies of browsers updates. The left panels show typical 
evolutions of the instantaneous time deficit parameter /3 associated with the queue of pending tasks 
as a function of time r, obtained by executing the simple priority queueing model. The arrow 
indicates the time of arrival of the target task, which is executed when (3 touches 0. The right 
panels display the distribution of waiting times Q{t) before the target task is executed obtained by 
simulating a population of 3000 individuals. The panels a, b and c correspond to three possible 
adaptation / reactions of users to the incoming target task. a. The arrival of the target task does 
not perturb the flow of task execution, b. The arrival of the target task triggers a stress (negative 
or positive); the target task(s) will be executed far much slower (bold line) (resp. faster (light line)) 
leading to a crossover towards a plateau (resp. exponential decay), c. Humans procrastinate, i.e. 
the target task is not necessarily executed even in absence of other tasks. The arrival of the target 
task can in addition increase (resp. decrease) the stress and its execution is less likely (bold line) 
(resp. more likely (light line)). 
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FIG. 2: Fraction of the population using a given Mozilla Firefox browser (minor) version as a 
function of time and over two data collection campaigns (cf. Appendix). The three major versions 
1.5.x, 2.x and 3.x are shown separately. 
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FIG. 3: a. Typical decays of the ccdf Q(t; /3, /?o) of waiting times till the update to the new version 
for Mozila Firefox 3.0.3 (green diagonal crosses and line), Mozilla Firefox 1.5.0.8 (yellow circles 
and line), Mozilla Firefox 1.5.0.11 (red diamonds and line) and Apple Safari 2 characterized by 



321 ] for all 44 fits). Points show the empirical 



two consecutive decays (blue crosses and lines). (See 

(bined) decays and lines represent the best fit. b. Corresponding Gaussian probability distribution 

function (pdf) i/j(/3;j3,(3o) of time deficit /3 defined by expression (jSJ). 
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FIG. 4: Phase diagram in the plane (/3,/3o) summarizing the different regimes for the decay of the 
ccdf Q(t; j3, /3o) of waiting times between message and update, predicted by expression Q with ©. 
The 44 +'s correspond to the best fits of the 44 data sets with expression ([9]). The line /3o ~ — 3/? 
separates the upper inefficient update region from the lower efficient update region. 
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